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On the asymptotic behaviour of the correlation measure of 
sum-of-digits function in base 2 

Jordan Emme* and Alexander Prikhod’kol 


Abstract 

Let S 2 {x) denote the number of digits “1” in a binary expansion of any a; € N. We study 
the mean distribution of the quantity S 2 (a; + a) — S 2 (a;) for a fixed positive integer a. It is 
shown that solutions of the equation 

S 2 (a; + a) — S 2 {x) = d 

are uniquely identified by a finite set of prefixes in {0,1}*, and that the probability distribution 
of differences d is given by an infinite product of matrices whose coefficients are operators of 

l\^)- 

Then, denoting by l{a) the number of patterns “01” in the binary expansion of a, we give 
the asymptotic behaviour of this probability distribution as I (a) goes to infinity as well as 
estimates of the variance of the probability measure 


Introduction 

In this article we are interested in the statistic behaviour of the difference of the number of digits 
1 in the binary expansion of an integer x before and after its summation with a. This kind of 
question can be linked with carry propagation problems developped in [5] and [9] and has to do 
with computer arithmetics as in [8] or [3] but our approach is different. Moreover, the function 
S 2 modulo 2 was extensively studied for its links with the Thue-Morse sequence (as we can find 
in 0) for example or for arithmetic reason as in [2] and [6]. But in this paper our motivation is 
not of the same nature and we will not look at S 2 mod 2 but just at the function S 2 which is the 
sum of the digits in base 2. 

Because of the simplicity of base 2 and of the summation process, we can give detailed 
behaviour of the correlation between the number of digits 1 in the binary expansion of x and 
X + a for any integer a. More precisely, we are able to compute the distribution of the difference 
S 2 {x + a) — S 2 (x) where S 2 {y) denotes the number of digits 1 in the binary expansion of any 
integer y. We essentially give two methods in Section 1. The first one, very elementary, just 
helps understanding the basics of our construction. The second that follows is a little bit more 
involved but allows us to present our result in a much more usable form for what interests us 
and what will be developped in the next section. 
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Section 2 is a technical part that allows us to understand the asymptotical behaviour of such 
a correlation as a gets bigger in a certain, non trivial sense. This section develops the idea 
that our correlation gets smaller as the number of subwords “01” in the binary expansion of a 
increases. To go further and develop this idea of increasing the number of patterns “01” in the 
binary expansion of a, we demonstrate a limit theorem for the random variable consisting of the 
difference of 1 in re and x + a for different a going to infinity in the previous sense. 

Let us now give some definitions and notations that will be used throughout this article. 

Definition 0.0.1. For any integer x G N whose binary expansion is given by : 

n 

x = '^Xk2’^, \fk e {0, XfcG{0,1}, 

fc=o 


we define the quantity 

n 

S2{x) = 

fc=o 

which is the number of digits 1 in the binary expansion of x. 

We will also define the following : 

Definition 0.0.2. For any integer x whose binary expansion is given by : 

n 

X = ^Xfc 2 ^, 
k=0 


we denote by x the word xo...Xn in {0,1}*. 

Remark 0.0.3. Since we are working in the free monoid {0, 1}* of binary words, let us remark 
that we will denote the cylinder set of a word w = wo...Wn by the standard notation [re] := {u G 
{0,1}^ I vo...Vn = wo-.-Wn}- These sets form a topological basis of clopen of {0,1}^ for the 
product topology. Moreover, we endow the set of binary configurations {0,1}^ with the natural 
probability measure P that is the balanced Bernouilli probability measure defined on the Borel 
sets. 

With that in mind, the equation we wish to study is the following, with parameters a G N 
and d G Z : 


S2{x + a) - S2{x) = d ( 1 ) 

More particularly, we wish to understand the behaviour, for any integer a, of the correlation 
between S2{x + a) and S 2 (x) given by : 


fia{d) := lim — 
N^OO iV 


#{x<N 


S2{x + a) - S2{x) = d} . 


Remark 0.0.4. Let us remark that for all integer a, this defines a probability measure on Z. 
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To this end, the first section will essentially be about the combinatorial description of the 
solutions of 0. We will describe the addition process via the construction of a particular 
tree. This will allow us to have an explicit formula for the desired distribution of probabilty 
only depending on the binary expansion of a as a product of matrices. The main result is the 
following : 

Theorem. The distribution is calculated via an infinite product of matrices whose coefficients 
are operators offi{T) applied to a vector whose coefficients are elements offiCZ) 

ha — {Id, Id) ■ ■ ■ 



where the sequence {an)nen is the binary expansion of a, (5o tde Dirac mass in 0 

To = 


Id iS-i 
- S' 


0 


( ^5 Id 


and S is the left shift transformation on fi{'L). 


Remark 0.0.5. We remind that the set of finite measures on Z is in bijection with the elements 
of fiifL). We will always identify finite measures on Z and elements of fi{'L). 


Such a result allows an analytical study of these distributions as the binary expansion of a is 
more and more complex. Let us introduce this notion of complexity for a : 


Definition 0.0.6. For any a G N, let us denote by l{a) the number of subwords 01 in the binary 
expansion of a. 


We are thus interested in what happens as there are more and more patterns 01 in the word 
a. We can precisely estimate the asymptotic behaviour of the /^(Z) norm of this distribution as 
a tends to infinity by increasing the number of subwords 01. 

Namely, the theorem is as follows : 

Theorem. There exists a real constant Cq such that for any integer a we have the following : 

Whah < Cq ■ l{a)~^/‘^. 


Finally, in the last section, we wish to obtain a much more precise result regarding the 
behaviour of such a distribution than just estimates of the 1“^ norm. So we study in which way 
the variance of the random variable of probability law pa is linked to the number of subwords 
01 in the binary expansion of a. We have bounds on this variance as shown in this result : 

Theorem. For any integer a such that l{a) is large enough, the variance —2V{a) of pa has 
bounds : 


l{a) - 1 < -2V{a) < 2{2l{a) + 1). 
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1 Statistics of binary sequences 


In this section we wish to understand the following quantity, for any given positive integer a and 
any integer d : 

fia{d) ■■= lim ^ #{x<N I S 2 {x + a) - S 2 {x) = d} . 

N-^oo JM 

We know from [T] that such a limit exists and has been studied in [7] for example, but we 
will give a proof using the structure of solutions of the following equation : 

S 2 (a; + a) - S 2 {x) = d. 


for any a and d. 

Let us investigate such solutions as well as their construction in order to understand the 
distribution of probability Ha of differences d = S 2 {x + a) — S 2 {x). We prove that this distribution 
is given by an infinite product of matrices whose sequence is given by the binary expansion of a. 

1.1 Combinatorial description of summation tree 

First we prove the following lemma : 

Lemma 1.1.1. For all a G N and d G Z, there exists a finite set of words Va,d = {pij ■■■iPk} C 
{0,1}* sueh that x is solution of 0 if and only if : 

k 

X G IJ [pj] 
i=i 

where [tc] denotes the cylinder set of configurations whose prefix is w for any word w G {0,1}*. 


Remark 1.1.2. Before we prove Lemma 1.1.1 let us remark that for any even integer n the 
following holds : 


,n. 


S2{n) = S2(-}. 


Proof. Let us prove this lemma by induction on a. It is obviously true that for a = 0 and a = 1, 
there exists a set of words for any d G Z (possibly empty) describing the solutions of Equation 
Let us assume it is true for every integer not greater than a given a G N. 

If a is even, then, for any d, let the set of words Va+i,d be {Ore,?/; G U {lw,w G 

Indeed, x G N being an even solution of S 2 (a: + a + 1) — 52 ( 2 ;) = d is equivalent to having 
S 2 {x + a) + 1 — S2(|) = d which can be written S2(f + f ) — ■S2(f ) = d — 1. From the induction 
hypothesis follows that x starts with a 0 followed by a word beginning by a word in 

Moreover, x G N being an odd solution of S 2 (x + a + 1) — S 2 {x) = d is equivalent to having 
S 2 (x — 1 + 2 + a) — (s 2 (^^) +1) = d which can be written S 2 (^^ +1 +1) — S 2 (^^) = d +1. Then 
we can state that x must start with a 1 followed by a word in V^j^i^d+i by induction hypothesis. 
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If a is odd, we have Va+i^ = {Ow, w G "Po+i U {Iw, w G "Po+i for any d. 

In fact, let X G N be an odd solution of S2(x + a + 1) — S2(x) = d. This is of course 
equivalent to having the equality S2(x + a) + 1 — (s2(^§^) + 1 ) = d which can be written 
S 2 (^^ + — S 2 (^^) = d. Moreover, x being an even solution of 52 ( 2 ^ + a + 1) — S 2 {x) = d 

is equivalent to having the equality S 2 (| + — S 2 (f) = d. These last equalities, with the 

induction hypothesis can be summarized as follows : x begins either by a 0 or a 1 and is followed 
by a word in "Po+i 

2 >“ 

□ 

From this lemma follows immediatly that for all positive integer a and integer d, the sequence 
(# {x < A^|s 2 (a; + a) — S 2 {x) = d})^gpj is a sum of sequences which contain an arithmetic pro¬ 
gression and thus the following limit exists : 

Ha{d) := lim ^ #{x<N \ S2ix + a) - S2{x) = d} . 

N^oo 

Moreover, a quick computation yields 

P&'Pa,d 


where P is the balanced Bernouilli probability measure on {0,1}^. 

The proof of Lemma 1.1.1 naturally gives the idea of an inductive way to compute the prefixes. 
A comfortable way to do that is to span, for every a, a tree Tq whose paths will represent the binary 
decomposition of some x and which will allow us to keep track of the quantity S 2 {x + a) — S 2 {x) 
as the summation goes. Let us state the following lemma : 


Lemma 1.1.3. For each a G N, there exists a tree Ta with vertices labeled in {0, ...,a} x Z and 
edges labelled in {0,1} such that for any d G Z, words in Va,d o,xe exactly paths from the vertex 
(a, 0) to a vertex (0,d). 

Proof. Let us choose a particular a in N and construct the associated tree Tq. The tree will 
have vertices labelled in {0,.., a} x Z and edges labelled in the binary alphabet {0,1}. This tree 
will also have its vertices on different levels ordered in N. We will construct our tree inductively 
on each level. 

If the tree is defined up to level n G N. From every vertex we construct the vertices at level 
n -|- 1 in the following way. 

Starting from a vertex labelled by {k,c) : 

• If /c is even we add a vertex at level n -|- 1 labelled by (|, c) and add one edge of each type 
between these vertices. 

• If however k is odd, we add two vertices labelled by c -|- 1) and c — 1) and we 
construct an edge of type 0 between {k,c) and c -|- 1) and an edge of type 1 between 
{k, c) and c — 1). 
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Figure 1: Infinite subtree spanned by every vertex labelled by 1 


The construction starts at level 0 with only one vertex labelled by (a, 0). 

Of course such a construction actually spans an infinite tree for we eventually build a vertex 
labelled by 1. Then, notice that a vertex 1 is parent of two vertices 0 and 1. Iterating the 
procedure builds an infinite subtree rooted in 1 as given in the example of figure 

The goal of this construction is then to read prefixes which are solutions of the equation as 
paths on this graph. Let us start at the vertex labelled (o, 0). The second coordinate of labels 
of vertices is a counter that keeps track of the difference 52 ( 2 ; + a) — S 2 {x) as the summation is 
processed digit by digit, and, in our case, that is represented by which vertex we go through as 
we follow X as a path in Ta starting from (a, 0). 

Starting from an odd vertex, if we follow the edge labelled by 0 then we add one to our 
counter and if we follow the edge labelled by 1 then we substract 1 to our counter. 

If however we start from an even vertex, the counter remains unchanged, regardless of which 
edge we follow. 

This allows to keep track of the difference S 2 {x + a) — S 2 (a^)- Once we reach a vertex (0,d), 
the summation is completed, so in order to find a prefix we only need to find a path from the 
vertex (a, 0) to a vertex (0, d) such that the counter reaches the desired d. 

For any a, the associated tree allows us to compute the set of prefixes Va,d for any d. Let 
us choose a positive integer x. We will follow the path associated to x starting from the vertex 
a. Then, on every level, following the path and keeping doing the indicated operations on our 
counter is just doing the summation digit by digit while keeping track of the 1 we have lost or 
added during this summation. If we are not careful, we might not end on a 0 (if the binary 
expansion of x is too short and/or a carry is still being propagated). However, adding enough 0 
to X ensures that we do not encounter this problem. 

This tree is then a convenient way to compute d = S 2 {x + a) — S 2 (x) and as such, to find the 
prefixes we are interested in as the different paths starting from (a, 0) and ending on a vertex 
(0,d). 
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□ 


We can see an example of such a construction on figure for a = 5. Vertices’ labels are 
boxed, edges labels are above them, the counter numbers are circled and, as a reminder, the 
increment or decrement we need to apply after each step are indicated on the edges. 

Example 1.1.4. Let a = 5. Then we start our construction on the first level of the tree by 
creating the root labeled by (5,0). The number 5 being odd, we add the vertices (2,1) and (3,-1) 
on the second level and add an edge of type 0 between vertices (2,1) and (5,0) and an edge of 
type 1 between vertices (3,-1) and (5,0). 

We continue the construction on third level by : 

• adding a child to vertex (2,1) labelled by (1,1) since 2 is even. 

• adding two children to vertex (3,-1) labbeled by vertices (1,0) and (2,-2) since 3 is odd. 
The edge between vertices (1,0) and (3,-1) is of type 0 and the edge between vertices (2,-2) 
and (3,-1) is of type 1. 

And we continue this process on each vertex on each level to obtain the tree of figure 



Let us compute the words in It can be read on the tree rs that this set is given by the 
words 00110, OHIO and 1010. 

We wish to remark that because of the way x is defined, the word 00110 for example is not 
the binary expansion of 6 but of 12 as everything is mirrored. 

Let us now remark that having such a family of trees proves the following proposition : 

Proposition 1.1.5. For every a inN and every d in Z, we have the following identities : 

h2a{d) = yLa{d) 
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and 


^^2a+l{d) = -Ha{d - 1) + -/Xa+i(d + 1). 

Proof. For any positive integer a and any integer d, we can read from the tree T 2 a that words in 
'P 2 a,d are exactly words beginning by either 0 or 1 and followed by a word in Va,d, thus we have : 

/i2a(d) = ^a(d) 

Notice that words in V 2 a+i,d are exactly words beginning by a 0 and followed by a word in Va,d-i 
and the ones beginning by a 1 followed by a word in Va+i,d+i- It follows that : 

fj-2a+lid) = ^fla{d - 1) + ^^a+l(d + 1). 


□ 


Remark 1.1.6. This proposition allows to compute explicitly any distribution since it is 

trivial to give explicit closed formulas for fiQ and //i. It also gives an understanding of the link 
between and the binary expansion of a. However, we prefer another presentation of such a 
result. Namely, one that assembles in a close formula these identities as we follows the binary 


decomposition of a. Such a formula is given in Theorem 1.2.1 


25 


12 


6 
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Figure 3: Collapsing T 25 . 


One way to do such a thing is to remark that if we collapse the tree Tq by identifying the 
vertices whose label’s first coordinate are the same, we obtain a graph whose structure is studied 
in the next section. 

Looking at this graph after collapsing is quite interesting for it gives us an understanding 
of the link between the summation process and the binary expansion of a. It also allows the 
statistical study of the behaviour of S 2 {x + a) — S 2 {x). An example of the tree T 25 collapsing is 
given in Figure 
















1.2 Distribution of S 2 {x + a) — S 2 {x) 

The goal of this section is to prove the following theorem : 

Theorem 1.2.1. The distribution is calculated via an infinite product of matrices whose 
coefficients are operators offi{7fi 

To, = {Id, Id) ■ ■ ■ • • • AaiAag 



where the sequence (an)neN is the binary expansion of a, do is the Dirac mass in 0 


^0 = 






and S is the left shift transformation on /^(Z). 


One can notice that only two different patterns can appear between two levels of the collapsed 
graph. Those patterns are given by figure 

Remark 1.2.2. Let us remark that there are always only two vertices on each level (except at 
level 0 where we can add the vertex a + 1 for coherence) labelled by two consecutive integers 
and that the order in which each pattern appears and how vertices are labelled are given by the 
binary expansion of a. 




o ^ 


2 

a,n 


'Ia,n+l 


Figure 4: The two different patterns encountered in the collapsed graph 


First, we begin by a definition : 

Definition 1.2.3. Let us define for each positive integer a and each level n of the collapsed graph 
the probability measures and on Z such that 'ni\^{d) is the probability that a path of 
length n starting from a in the associated collapsed graph ends on the vertex at level n with 
the counter value being d. Here we still endow {0,1}* with the balanced Bernouilli probability 
measure. 

This seems to only define a sequence in fiifL). Let us remind that we can identify finite 
measures on Z with sequences in f^(Z). It is a quick check to verify that we did define probability 
measures. 
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Lemma 1.2.4. The are given by induction : 


Vn G N, 


\'la,n+l/ 


ha,n 

Va,n 


where 


+ 0O 


a = 


Y,ak2^ 


k=0 


Proof. Let n G N, d G Z and a G Z+. We wish to compute rj^n+i han+i- is obvious that 
the pattern we see between levels n and n + 1 are given by a„. 

If = 0, then we encouter the left pattern of hgure[^ In this case we have : 


vin+lid) = vl,nid) + Ifhinid “ I)' 
This being true for all d, we can write : 


Va,n+1 Va,n 2 *^ iVa,n)^ 

where S is the left shift transformation on the space For the same pattern we also have : 

1 


ha,n+l{d) = 


which can be rewritten 


ha,n+l r\^iha,n) 


So in this particular case we can write the relations in the following way : 




7T^ 
la,n-\-l 


- S' 


Va,n 

Va,n 


The same arguments for the right pattern of figure (which corresponds to the case where 
an = 1) yields : 

'<n+l\ _ flS-^ 


„2 

la,n+l 


- S' 


0 

Id 


ha,n 

Va,n 


which ends the proof of the lemma. 


□ 


This lemma allows us to prove Theorem 1.2.1 


Proof. Let a G N and /Xa £ be the distribution of probability of the difference S 2 {x + a) — 
S 2 {x). From Lemma 1.1.1 we know that the set of solution for S 2 {x + a) — S 2 {x) = d for any 
d G Z is given by a finite set of prefixes. We can consider them to be all of size by adding 
prefixes if needed (which means partitionning our cylinder sets from Lemma 1.1.1 into smaller 
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cylinders of constant size We can also assume that is bigger that the highest n such 

that ttn = ^ with the same argument of adding prefixes (i.e. partitionning again). 

Then the probability fiaid) is given by since we need to end the summation on a 

vertex labelled by 0. Let us also notice that ~ ^ for otherwise we would have a solution 

prefix whose length would be greater than rud- With the previous lemma we thus have the 
following : 

^ia{d) = [{IdJd)Aa^^Aa^^_,---Aa,Aa, ) {d). 

It is also worth noticing that after rank rud keeping multiplying on the left by Aq will not 
have any influence on the final result so this allows for a more convenient writing of our result : 

fiaid) = ((/d, Id)--- Ao ) id) 

which proves the theorem. 

□ 


2 Asymptotic properties of distributions fia 

In this section we study the asymptotic behaviour of the family of distributions fia as I (a) 
increases. We prove that the norm of tends to 0 as l{a) increases hence the densities 

of fia tend to zero. 

Let us remind that I (a) denotes the number of distinct patterns 01 in the binary expansion 
of a. 

Remark 2.0.5. For any integer a there exists k integers pi, ...,pk such that a = 0^^or 
a = depending on a being even or odd. In the first case, k = 2l{a), and in the second, 

k = 2l{a) + 1. 

In other words, the number of distinct continous maximal blocks of digits in a is either 
2l{a) + 1 or 2l{a). 


2.1 Convergence to zero 

We start by proving the following theorem. 

Theorem 2.1.1. There exists a constant Cq such that for any 

\\ha\\2<Co-l{a)-^/\ 

The proof is established in a series of lemmas. Let us recall 


An = 


Id 

1 Q 
2 *^ 


0 


A = 


1 Q—1 
2*^ 

1 Q 
2“^ 


integer a we have the following : 


the notation 
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where S is the left shift on /^(Z). Let us define a norm on the space of n x n matrices Y = 
iyi,j)i,j£{i,...,nY by the following : 


l<j<n 

2=1 

Remark 2.1.2. Let us remark right away that this defines a submultiplicative norm. 
We consider the Fourier transform jua defined on the circle T by : 

V6» G [0, 27r), fia{0) = ^ 

dez 


Notice that applying the operator S to an element of /^(Z) transforms to the multiplication by e*^ 
of the fourier transform hence the linear operators become ordinary linear transformations 

in for any given 6 defined in the following way 

/f 0\ 

^oTO:=(„ l). 

Lemma 2.1.3. We have the following elementary inequalities for any 9: 

IlioWlli = ||ii(0)||i = ||io(0)iiWI|i = ||ii(0)io(0)||i = 1 and 


\\Ao{e)Aiie)Aom\i = ||ii(0)io(0)ii(0)||i := m 

Notice that f is strictly less than 1 except when 0 = 0. 
Proof. For any 6 G [0, 27r), we have : 


1 + \/5 + 4cos0 
4 


m -ie , 1 ip-2ie , i -w , i 

i„(e)ii(e)i„( 9 ) = (,/*!,» “ 

8^ f 4^ 


46 


and 


so he have 


i„-ie I i„-2ie lp-2ie 

Ai{ 9 )Aq{ 6 )Ai{ 9 ) = ( 2*0 I ip*0 I 1 l ie I 1 

,4^ ■'“ 8 ^ “'“4 2^ “'“4 


Po(0)ii(0)io(0)||i = Pi(0)AoW^iWlli. 


Moreover, the triangular inequality yields 


^ -^e 

+ 


< 

lg-2*0 

+ 

1 -ie 1 

+ T 

+ 


4 8 4 


4 8 


4 


8 4 


4 8 


so we have 



+ 


< 

1 ^2*0 

+ 

1 ie 1 

+ V 

4 8 4 


4 8 


4 


2 4 
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hence the sum of the modulus of the terms in the first column is less than the sum of the modulus 
of the terms of the second column, which implies that 


so finally 


||io(0)ii(0)Ao(0)||i = 




le-2^oj 

+ 

1 id 1 

+ 7 

4 


2 4 


1 + \/5 + 4 cos 9 


Lemma 2.1.4. For any k eW and any 9 G [0, 27r) 

||io(0) (ii(0))Mo(0)||i < \\Ao{e)A^{e)Ao{9)h. 

Proof. First let us state that, for all positive integer k : 

/ ^ — ikO P\ 


V0G[O,27r), 


where 




k ^-ije 


j=i 


2J 


We can compute that : 

va:gn, Ao{e) (Aiio))’^ Ao{e) = 


f^ + ^s,(9) 


+ ^Sk(9) + i 


Moreover, noticing that for every positive integer k and every 9 in [0, 27r), 


Sk+i{ 9 ) = —^Sk{ 9 ) + — 


implies that 


^-i{k+l)e g-j 0 

2fc+i + -i2^^k+i{0) 


□ 


+ 


(^) 


^-i{k+l)6 ^-2i0 1 

^ 2k+l + -^^k{0) + ^ 


+ 


2 ie 


\sk{0) + ^ 


Which means that the sum of modulus of coefficient on the first column of matrix 
Aq{9) {Ai{9))^~^^ Aq{9) is equal to the sum of the modulus of the coefficients of the second 
column of the matrix Ao{9) {Ai{9))^ Ao{9). Hence to prove that 

VfeGN, V0G[O,27r), \\Ao{9) {Ai{9))’^ Ao{9)\\i < \\Ao{9) Ai{9) Ao{9)\\i 
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suffices to actually show the following : 


VA:>2, V0G[O,27r), 


^—ikO 


^ + -^Ski9) 


+ 


„i9 


< PoWiiWioWlli. 


Using triangular inequality, we have 
VA;>2, V0E[O,27r), 


^—ikO 




+ 


je 


-Sk{0) 


< ^ + I'SA:(6')| 


which, still using triangular inequality, yields 
yk>2, V0G[O,27r), 

and 

ye E [0,27r), 

So, in any case, we get : 

yk>2, V0E[O,27r), 

which completes the proof since 


^—ikO 

e 



1 

g20 

1 

2k 

1 2 

+ 


- ^ + 

T 

+ 4 

,-^2e 




1 


1 

22 

+ 2 ^2(^) 

+ 


< p + 

T 

+ 4 


+ E 

j=3 


2J 


p—ik9 p—i9 





2fc + 2 

+ 


< 

T + 4 


1 

+ 4 


1 

T + 4 


Lemma 2.1.5. If —tt < 0 < tt then 


Hence for any positive integer N : 


+ - = Po(0)Ai(0)Ao(0)||i 


m < 




2<l|e 


with (j) as defined in 2.1.3\ 

The proof is left to the reader. 


□ 


Proof of theorem \2.1.1\ Recall that pLa = + ha ,2 is the sum of the components of the vector 

ha-, which is calculated as an infinite product 

^ . . . ^ai^ao ( ^0 


ha,2 
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Let us represent the binary expansion of a as a sequence of l{a) groups 11... 1 separated by 
zeros: 


a = 0™ill... lO^m ... 10.11... 1. 


We apply Lemma 2.1.4 to half of the patterns Oil..10 and use the bound of Lemma 2.1.3 


ioW (ii(0))"io(0)||i <</>(0). 


We can do this only on half of patterns Oil..10 to avoid problematic patterns like 
...0111011110 ..., since we need at least two “0” between two blocks on “1” to be able to 
apply Lemma 2.1.4 twice but we have only one “0” in between. Thus, we get 




l{a) — 1 


and hence, for each 9 G [0, 27r) and j G {1,2} 

and 


'V 

. 0 , 


_ -Ne'^115 

1 


\\^^a,j lb — 


1 


^a,ilb ^ 


1 


lN 


Finally, we get using lemma 2.1.5| 

ll^alb < V2 


1 /■“ 

b = -n= / e 

V 27r J—QQ 




\8ttN J 


15 


7r(/(a) - 1) 


1/4 




□ 


Remark 2.1.6. It follows directly from Theorem 2.1.1 that the density /iaO) —)• 0 as l{a) —)• oo. 

Let us also remark that this theorem actually gives an important information as to the fact 
that the probability measure Ha is linked with the complexity of the binary expansion of a, 
complexity which is measured by l{a). In fact, let us remark that for any integer n, /i 2 " = /ti so 
it is possible to find arbitrarily large a such that ||/ia|b does not tend to zero (actually whenever 
l{a) does not tend to infinity). 


2.2 Asymptotic mean and variance of /Tq 

Let us first recall the following : 


V0E[O,27r), Aoie) 



Am ■■= 


Ip-ie 

Iptd 


We begin by using Taylor’s expansion on the matrices ^o(^) and Ai{9) 


we have : 


Ak{9) — Ik 9 cxk 9“^ Pk 0(9^), k G {0, 1}, 
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where 


In = 


vO I 


9 

^1= 1 
V 1 


-h:'T 

:)■ 


1 fo V 

1 A 0^ 

^--i I 0, 


and we observe that the following relations hold 

( 1 , 1)4 = ( 1 , 1 ), 

1 


1 


(l,l)afc = (0,0), (l,l)/3o = --(0,l), (1,1)4 = --(1,0). 


( 2 ) 

( 3 ) 


Lemma 2.2.1. For all 0 G [0,27r), the product of matrices {Aq{ 0))^ converges as k goes to +oo. 
We denote the limit ^“(0) 


1 


-ie 


2-e»» 

0 0 


A^i0) = 

Proof. Remarking the following : 

V0g[O, 2A, (io(A)'^ = 

where 


1 Sk{6)\ 

0 ^ 


jje 


Sk{0) = e 


1=1 


2^ 


yields the lemma. 


□ 


Let us now remark that the asymptotic expansion of A^ is : 

ig"(0) =/oo+A/3oo + 0(A), 


where 


hcX) - 


A 4 
.0 0 , 


, /3cxd — 


'0 r 

.0 0 , 


Definition 2.2.2. Given a G N of binary length N, let us define na(6*) = A^Aajy(0) ■ ■ ■ ^ao(^)- 
Let D := {n G : ni + ^2 = 1}. 

Lemma 2.2.3. For any real vector uq G D the product (1,1) • na(6l) • vq has the following 
asymptotic expansion 

(l,l)-nAA-no = 1 + R(a) A + o(A), 
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where 

V(a) = ( 1 , 1 ) (/3ooVN+1 + /3a^VN + • • • + /3aofo) , 

and 

Vj G {1, N + 1}, Vj = Iaj_i ■ ■ ■ laoVO- 
Proof. Let us first recall that : 

(l,l)/0 = (l,l)/l = (l,l)/oo = (l,l). 

Moreover : 

N 

(1, 1) • na(0) • Vq = (1, 1) (loo + (/a; + Ckai^ + + 0{9^)) Vq. 

i=l 

Hence the constant term in the asymptotic expansion of (1,1) • na(0) • uq is 1 (since vq is in D). 
In addition, we have : 

(l,l)ao = = (0,0) 

so the term of degree 1 is 0. 

With the same argument, we can compute the quadratic term V{a). Notice that any coeffi¬ 
cient ao or ai is killed by left multiplication by (1,1). Hence the terms of the form : 

-loo laj-i •• --lai+i ^ai^ai—i ■ ■•hao 

disappear after the left multiplication by (1,1) and thus do not contribute in any way to the 
coefficient of the quadratic term. 

So the only terms contributing to V{a) in the product Aoo{9)Aaj.f{9)...AaQ{9) are the terms 
with coefficient /3o)/3i or /3oo- 

□ 


Lemma 2.2.4. For any j G {0,...,N}, the value {l,l)PajVj is estimated as follows 


1 




1 


1 


2 i . 




where is the length of a consecutive and maximal sequence of digits equal to aj to the left of j 
(including position j) in a, and bj is the length of the next block of identical digits to the left of 
the block containing the digit aj. 

Proof. Without loss of generality assume that Oj = 0. Observe that Iq and Ji contract the 
segment [(1,0), (0,1)] to the first and to the second point respectively with the contraction 


, 6 ? 


factor 2. Hence the block maps [(1, 0), (0,1)] into the segment [(2 '’j,( 1 — 2 ^i)),(0,1)]. 
bl-l 


Then the block 


contracts later to 
(l - 2“'’1+^ 


(1 - 2“''i),2“'’i+^(l - 2“'’i)), (l - 2“'’1+\2“^1+^) 


And multiplying on the left by (1, l)/?aj = (1, l)/3o = —^(0,1) means taking the second coordinate 
with an additional coefficient —1/2, and the lemma follows. 


□ 
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Remark 2.2.5. The same proof yields : 


(1) l)/5oo'*^Af+l ^ ,1 , • 

2 JV ^ 

Moreover, we have the following lemma : 

Lemma 2.2.6. 

^ 1 

Ka) < ^ < 2l{a) + 1. 

j=o 2 ^ 

Proof. This lemma is obtained by noticing that each block of identical digits in the binary 
expansion of a spans in ^ ^ partial sum of a geometric sequence of ratio 4 and first term 

^ so each is smaller than 1 and greater than In fact let us assume that there are k blocks in 
the binary expansion of a of lengths (i.e a = for example). Then 


N 


k li 


Ei = EE 

j=0 ^ ^ 1=1 j=l 




and 


fc ^ k k 


E)sEE 

i=l i=l j=l 

Moreover, k can be equal to either 2l{a) + 1 or 2l{a) which completes the proof. 


y 

^ i=l 


□ 


Theorem 2.2.7. For any integer a such that l{a) is large enough, the variance —2V{a) of fia 
has bounds : 


l{a) - 1 < -2V{a) < 2{2l{a) + 1). 

Proof. The idea of this theorem is to see that each block of same digits in the binary expansion 
of a has an impact on I^(a) estimated by a constant. 

Let us estimate the value of V(a) : 


N ^11 

-V{a) = -^(l,l)/3„.Uj - {l,l)/3^VN+i <'^-gr + 


j=0 j=0 

by Lemma 2.2.4 Moreover, the upper bound of Lemma 2.2. 6| yields: 

-2V(a) < 2(2/(a) + 2). 


2®i 2^^v-i' 


Now we will prove the lower bound : 

-Via) > 
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Let us remark that 


N N 

-v{a) = - ^(1, l)l3ajVj - (1, l)f5ooVN+i > - 

j=0 j=0 


so, using lemma (2.2.4), we have : 




j=0 


2 j V 2 J 


Hence, 


N N 

V{a) - „ 6 i+b 2 • 


j=0 ^ 2 j=0 ^ 2 ■' 


Let US denote by k the largest integer such that oq = ... = a^. Noticing that b'j = 0 for all 
j G {0,..., k} we have : 

N JV 

-V{a) > ^ Y1 

j=k+l ^ ■' j=k+l ^ ■' ■' 

Moreover, for any integer j larger than A;, 6^ > 1 hence : 


1 ^ 1 
■''(“) £ 5 E s 


from which we derive : 


2 2^j 

j=k+i ^ 


-H(a)>-(/(a)-l) 


by applying the lower bound of Lemma|2.2.6| . This proves Theorem 2.2.7 


□ 


On a final note, it is interesting to remark that the bounds for Theorem |2.2.7| might not be 
optimal but that for a sequence of integers (a(n))neN such that lim^^oo K“(^)) = +oo and such 
that the limit of exists, then this limit value is in [1,4]. We give some examples of 

computer simulations in Figure]^ 

It would be interesting to understand the asymptotic behaviour of the ratio • Having 

a necessary condition for convergence and a precise idea of how it behaves asymptotically would 
help understanding the variance of the probability measure [la- 
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3.5 


2.5 



0 200 400 600 800 1000 

(a) ^ST^poura(n) = ELo2^'' 



(c) pour a{n) = ELo 2®'' + 2®'=+^ + 2«'=+2 


2.5 



0 200 400 600 800 1000 


( b ) = ^^0 2 "'' + 2 ^'=+^ 



(d) =|^POur^=10l20^..1 


Figure 5: Asymptotics of the ratio for different sequences {a{n))n£n 
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